Picture for Zisu Huang

Zisu Huang

From Static Context to Calibrated Interactive RL: Mitigating Distribution Shift in Multi-turn Dialogue with Aligned Simulator

Add code
May 26, 2026
Viaarxiv icon

SkillOpt: Executive Strategy for Self-Evolving Agent Skills

Add code
May 25, 2026
Viaarxiv icon

From Raw Experience to Skill Consumption: A Systematic Study of Model-Generated Agent Skills

Add code
May 22, 2026
Viaarxiv icon

Reward Hacking in the Era of Large Models: Mechanisms, Emergent Misalignment, Challenges

Add code
Apr 15, 2026
Viaarxiv icon

Learning Query-Specific Rubrics from Human Preferences for DeepResearch Report Generation

Add code
Feb 03, 2026
Viaarxiv icon

TRIP-Bench: A Benchmark for Long-Horizon Interactive Agents in Real-World Scenarios

Add code
Feb 02, 2026
Viaarxiv icon

BatCoder: Self-Supervised Bidirectional Code-Documentation Learning via Back-Translation

Add code
Jan 30, 2026
Viaarxiv icon

Controllable Memory Usage: Balancing Anchoring and Innovation in Long-Term Human-Agent Interaction

Add code
Jan 08, 2026
Viaarxiv icon

CSSG: Measuring Code Similarity with Semantic Graphs

Add code
Jan 07, 2026
Viaarxiv icon

Benchmark^2: Systematic Evaluation of LLM Benchmarks

Add code
Jan 07, 2026
Viaarxiv icon